aws: support for bring-your-own hosted zone by staebler · Pull Request #4772 · openshift/installer

staebler · 2021-03-19T03:51:44Z

Add the .aws.hostedZone field to the install config to support the user supplying an existing hosted zone for the internal private hosted zone for the cluster. This can only be used when the user is also supplying their own VPC. The hosted zone must already be associated with the user-provided VPC.

Add validation in the "Platform Provisioning Check" asset for the user-provided internal hosted zone. The validation checks that
(1) the hosted zone is associated with the user-provided VPC and that (2) the hosted zone does not contain any record sets for subdomains of the cluster's domain.

The latter of these checks is meant to provide a modicum of protection against the user accidentally trying to install again a cluster that they have already installed. When it comes time to destroy the second failed installation, the destroyer will not be able to tell that the records sets in the hosted zone are actually being used by a different cluster.

When the user provides the private hosted zone to use for the cluster, the destroyer still needs to delete the recordsets for the cluster when the cluster is destroyed. There is no way to tag recordsets, so we must rely on tagging the hosted zone as shared. When the destroyer encounters a hosted zone tagged as shared by the cluster, the destroyer will delete all recordsets in that hosted zone that are strict subdomains of the cluster's domain. The cluster domain is added to the AWS cluster metadata so that it is available to the destroyer.

https://issues.redhat.com/browse/CORS-1666

staebler · 2021-03-19T04:07:42Z

/test e2e-aws-shared-vpc

staebler · 2021-04-05T15:17:36Z

5659a4fc1...34aef8216

rebased to changes from Byo IAM roles for IPI install #4790

patrickdillon

I have reviewed the first three commits and will take a closer look at the destroy code soon.

patrickdillon · 2021-04-07T22:55:20Z

nit: if -> is

patrickdillon · 2021-04-07T22:57:02Z

I don't get why we continue here. It seems like it is ok to have a record equal to the cluster domain. Is that the case?

Yes, every hosted zone has two records which are NS and SOA records with the same domain as the hosted zone. I will add a comment about this.

patrickdillon · 2021-04-07T23:01:57Z

Is invalid the right type of error here? Perhaps InternalError would be more suitable?

Yep, that makes sense.

patrickdillon · 2021-04-07T23:17:41Z

nit: comment does not match function name

patrickdillon

This looks mostly sane to me with a few nits and a couple of questions, especially regarding the trailing dot. But it looks good. I may try to take another closer look as I don't have a lot of background in AWS destroy.

patrickdillon · 2021-04-08T14:09:29Z

nit: I'm not sure + in the verb %#+v is doing anything. From fmt:

%v the value in a default format when printing structs, the plus flag (%+v) adds field names %#v a Go-syntax representation of the value

https://play.golang.org/p/-LjA46_hvi3

I agree. This was copied from

installer/pkg/destroy/aws/aws.go

Line 1880 in 6d778f9

lastError = errors.Wrapf(err, "deleting record set %#+v from zone %s", recordSet, id)

. I will remove it here.

patrickdillon · 2021-04-08T14:21:42Z

nit (or maybe more of a rant): I realize (now) this code is not original to this PR, but this seems like a convoluted way of doing:

key := "kubernetes.io/cluster/" + clusterID o.removeSharedTag(ctx, session, tagClients, key, tracker);`

Perhaps this is more future proof. Let's not fix what's not broken, just ranting or maybe I'm missing something.

Yes, this is a bit of code that always takes me too long to understand every time that I look at it. I don't know that we want to rely on the clusterID being set correctly, although I can't think of a good reason why it wouldn't be.

Maybe it would be sufficient to have a function that gets the cluster tag keys from the filter.

func clusterOwnedKeys(filters []Filter) []string { var keys[] string for _, filter := range filters { for key, value := range filter { if !strings.HasPrefix(key, "kubernetes.io/cluster/") { continue } if value != "owned" { continue } keys = append(keys, key) } } return keys }

And then the removeSharedTags function could be the following.

func removeSharedTags(ctx context.Context, tagClients []*resourcegroupstaggingapi.ResourceGroupsTaggingAPI, filters []Filter, logger logrus.FieldLogger) error { for _, key := range clusterOwnedKeys(filter) { if err := removeSharedTag(ctx, tagClients, key, logger); err != nil { return err } } return nil }

patrickdillon · 2021-04-08T14:36:01Z

Should this be capitalized: nothing -> Nothing?
Or is that not necessary because it is WithField?

It should be capitalized.

patrickdillon · 2021-04-08T14:36:13Z

Same question about caps: no -> No

patrickdillon · 2021-04-08T15:09:24Z

Is the name of the record set dotted or is it the the value of the record that is dotted? When I look in the GUI it looks like the name isn't dotted but the value is.

The name of the record set is dotted.
For example,

{ "Name": "api-int.ewolinetz3.devcluster.openshift.com.", "Type": "A", "AliasTarget": { "HostedZoneId": "ZLMOA37VPKANP", "DNSName": "ewolinetz3-rpcfk-int-42decb5de56b9b0b.elb.us-east-2.amazonaws.com.", "EvaluateTargetHealth": false } },

patrickdillon · 2021-04-08T15:10:23Z

same comment as above about %#+v verb

patrickdillon · 2021-04-08T15:10:29Z

same comment as above about %#+v verb

patrickdillon · 2021-04-08T15:26:24Z

Should we add in the error message for the public record that we skipped deleting the private record?

No. The destroyer should try again later to delete the records.

Add the `.aws.hostedZone` field to the install config to support the user supplying an existing hosted zone for the internal private hosted zone for the cluster. This can only be used when the user is also supplying their own VPC. The hosted zone must already be associated with the user-provided VPC. https://issues.redhat.com/browse/CORS-1666

When the user provides an existing hosted zone to use for the cluster's private zone, the dns.config.openshift.io resource needs to be adjusted so that the operator can locate the hosted zone. Since the hosted zone already exists, we can specify the ID of the hosted zone rather than relying on tags.

Add validation in the "Platform Provisioning Check" asset for the user-provided internal hosted zone. The validation checks that (1) the hosted zone is associated with the user-provided VPC and that (2) the hosted zone does not contain any record sets for subdomains of the cluster's domain. The latter of these checks is meant to provide a modicum of protection against the user accidentally trying to install again a cluster that they have already installed. When it comes time to destroy the second failed installation, the destroyer will not be able to tell that the records sets in the hosted zone are actually being used by a different cluster.

staebler · 2021-04-08T21:10:30Z

@patrickdillon I have addressed your feedback.

When the user provides the private hosted zone to use for the cluster, the destroyer still needs to delete the recordsets for the cluster when the cluster is destroyed. There is no way to tag recordsets, so we must rely on tagging the hosted zone as shared. When the destroyer encounters a hosted zone tagged as shared by the cluster, the destroyer will delete all recordsets in that hosted zone that are strict subdomains of the cluster's domain. The cluster domain is added to the AWS cluster metadata so that it is available to the destroyer.

patrickdillon · 2021-04-09T14:52:19Z

I'm having a hard time reasoning through this destroy code in the context of avoiding deleting non-cluster records. I may also be hitting some limits in my knowledge.

I need to revisit the code & will do that immediately but we're in a bit of a time crunch so I want to write this out to aid the process/discussion.

What about this scenario (I think this would pass pre-provision check, but let me know if something else makes this invalid):

We have a shared hosted zone: shared.devcluster.openshift.com

We install a cluster with the cluster domain shared.devcluster.openshift.com

We install a second cluster with the cluster domain foo.shared.devcluster.openshift.com

We delete the first cluster with cluster domain shared.devcluster.openshift.com doesn't that delete the records for the second cluster?

If so, I am thinking we should lean toward deleting whitelisted records in this shared scenario (e.g. only delete api + clusterdomain, etc) I'm sure you've considered this and I imagine the downside is that IF the cluster creates non-whitelisted records they will be leaked, but that's not the case ATM right?

patrickdillon · 2021-04-09T15:29:40Z

/lgtm

staebler · 2021-04-09T17:28:47Z

For posterity, since this was discussed outside of this PR,

I'm having a hard time reasoning through this destroy code in the context of avoiding deleting non-cluster records. I may also be hitting some limits in my knowledge.

I need to revisit the code & will do that immediately but we're in a bit of a time crunch so I want to write this out to aid the process/discussion.

What about this scenario (I think this would pass pre-provision check, but let me know if something else makes this invalid):

We have a shared hosted zone: shared.devcluster.openshift.com

We install a cluster with the cluster domain shared.devcluster.openshift.com

We install a second cluster with the cluster domain foo.shared.devcluster.openshift.com

We delete the first cluster with cluster domain shared.devcluster.openshift.com doesn't that delete the records for the second cluster?

Yes that will delete the records for the second cluster. The contention here is that it is a mis-configuration to have the second cluster use a cluster domain that is a sub domain of the first cluster. The first cluster owns its entire cluster domain. We cannot check for this on pre-provision because there is no way to (reliably) tell if a cluster with a parent domain is in the hosted zone already.

If so, I am thinking we should lean toward deleting whitelisted records in this shared scenario (e.g. only delete api + clusterdomain, etc) I'm sure you've considered this and I imagine the downside is that IF the cluster creates non-whitelisted records they will be leaked, but that's not the case ATM right?

A whitelist is problematic because record sets can be created by in-cluster components with names that the installer does not know about. The installer only creates the api and api-int record sets. For example, the *.apps record set is created in-cluster.

staebler · 2021-04-09T17:28:56Z

/approve

openshift-ci-robot · 2021-04-09T17:29:16Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: staebler

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [staebler]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

openshift-bot · 2021-04-10T15:12:25Z

/retest